智能论文笔记

Can you recommend content to creatives instead of final consumers? A RecSys based on user's preferred visual styles

Raul Gomez Bruballa , Lauren Burnham-King , Alessandra Sala

分类：计算机视觉 | 机器学习

2022-08-23

由于用户不是最终的内容消费者，因此在内容市场中提供有意义的建议是具有挑战性的。取而代之的是，大多数用户是创意者的兴趣，与他们从事的项目相关，迅速而突然地改变。为了解决向内容创建者推荐图像的具有挑战性的任务，我们设计了一个recsys，以学习视觉样式的偏好，横向用户工作的项目的语义。我们分析了任务的挑战与语义驱动的基于内容的建议，提出评估设置并解释其在全球图像市场中的应用。该技术报告是ACM Recsys '22介绍的论文“学习用户在图像市场中的首选视觉样式”的扩展。

translated by 谷歌翻译

A Machine Learning Case Study for AI-empowered echocardiography of Intensive Care Unit Patients in low- and middle-income countries

Xochicale Miguel , Thwaites Louise , Yacoub Sophie , Pisani Luigi , Tran Huy Nhat Phung , Kerdegari Hamideh , King Andrew , Gomez Alberto

分类：机器学习

2022-12-30

We present a Machine Learning (ML) study case to illustrate the challenges of clinical translation for a real-time AI-empowered echocardiography system with data of ICU patients in LMICs. Such ML case study includes data preparation, curation and labelling from 2D Ultrasound videos of 31 ICU patients in LMICs and model selection, validation and deployment of three thinner neural networks to classify apical four-chamber view. Results of the ML heuristics showed the promising implementation, validation and application of thinner networks to classify 4CV with limited datasets. We conclude this work mentioning the need for (a) datasets to improve diversity of demographics, diseases, and (b) the need of further investigations of thinner models to be run and implemented in low-cost hardware to be clinically translated in the ICU in LMICs. The code and other resources to reproduce this work are available at https://github.com/vital-ultrasound/ai-assisted-echocardiography-for-low-resource-countries.

translated by 谷歌翻译

Democratizing Machine Translation with OPUS-MT

Jörg Tiedemann , Mikko Aulamo , Daria Bakshandaeva , Michele Boggia , Stig-Arne Grönroos , Tommi Nieminen , Alessandro Raganato , Yves Scherrer , Raul Vazquez , Sami Virpioja

分类：自然语言处理

2022-12-04

This paper presents the OPUS ecosystem with a focus on the development of open machine translation models and tools, and their integration into end-user applications, development platforms and professional workflows. We discuss our on-going mission of increasing language coverage and translation quality, and also describe on-going work on the development of modular translation models and speed-optimized compact solutions for real-time translation on regular desktops and small devices.

translated by 谷歌翻译

Out-Of-Distribution Detection Is Not All You Need

Joris Guérin , Kevin Delmas , Raul Sena Ferreira , Jérémie Guiochet

分类：机器学习 | 人工智能 | 计算机视觉

2022-11-29

The usage of deep neural networks in safety-critical systems is limited by our ability to guarantee their correct behavior. Runtime monitors are components aiming to identify unsafe predictions and discard them before they can lead to catastrophic consequences. Several recent works on runtime monitoring have focused on out-of-distribution (OOD) detection, i.e., identifying inputs that are different from the training data. In this work, we argue that OOD detection is not a well-suited framework to design efficient runtime monitors and that it is more relevant to evaluate monitors based on their ability to discard incorrect predictions. We call this setting out-ofmodel-scope detection and discuss the conceptual differences with OOD. We also conduct extensive experiments on popular datasets from the literature to show that studying monitors in the OOD setting can be misleading: 1. very good OOD results can give a false impression of safety, 2. comparison under the OOD setting does not allow identifying the best monitor to detect errors. Finally, we also show that removing erroneous training data samples helps to train better monitors.

translated by 谷歌翻译

edBB-Demo: Biometrics and Behavior Analysis for Online Educational Platforms

Roberto Daza , Aythami Morales , Ruben Tolosana , Luis F. Gomez , Julian Fierrez , Javier Ortega-Garcia

分类：计算机视觉

2022-11-16

We present edBB-Demo, a demonstrator of an AI-powered research platform for student monitoring in remote education. The edBB platform aims to study the challenges associated to user recognition and behavior understanding in digital platforms. This platform has been developed for data collection, acquiring signals from a variety of sensors including keyboard, mouse, webcam, microphone, smartwatch, and an Electroencephalography band. The information captured from the sensors during the student sessions is modelled in a multimodal learning framework. The demonstrator includes: i) Biometric user authentication in an unsupervised environment; ii) Human action recognition based on remote video analysis; iii) Heart rate estimation from webcam video; and iv) Attention level estimation from facial expression analysis.

translated by 谷歌翻译

Understanding the properties and limitations of contrastive learning for Out-of-Distribution detection

Nawid Keshtmand , Raul Santos-Rodriguez , Jonathan Lawry

分类：机器学习 | 计算机视觉

2022-11-06

A recent popular approach to out-of-distribution (OOD) detection is based on a self-supervised learning technique referred to as contrastive learning. There are two main variants of contrastive learning, namely instance and class discrimination, targeting features that can discriminate between different instances for the former, and different classes for the latter. In this paper, we aim to understand the effectiveness and limitation of existing contrastive learning methods for OOD detection. We approach this in 3 ways. First, we systematically study the performance difference between the instance discrimination and supervised contrastive learning variants in different OOD detection settings. Second, we study which in-distribution (ID) classes OOD data tend to be classified into. Finally, we study the spectral decay property of the different contrastive learning approaches and examine how it correlates with OOD detection performance. In scenarios where the ID and OOD datasets are sufficiently different from one another, we see that instance discrimination, in the absence of fine-tuning, is competitive with supervised approaches in OOD detection. We see that OOD samples tend to be classified into classes that have a distribution similar to the distribution of the entire dataset. Furthermore, we show that contrastive learning learns a feature space that contains singular vectors containing several directions with a high variance which can be detrimental or beneficial to OOD detection depending on the inference approach used.

translated by 谷歌翻译

Deep Learning for Rapid Landslide Detection using Synthetic Aperture Radar (SAR) Datacubes

Vanessa Boehm , Wei Ji Leong , Ragini Bal Mahesh , Ioannis Prapas , Edoardo Nemni , Freddie Kalaitzis , Siddha Ganju , Raul Ramos-Pollan

分类：计算机视觉

2022-11-05

With climate change predicted to increase the likelihood of landslide events, there is a growing need for rapid landslide detection technologies that help inform emergency responses. Synthetic Aperture Radar (SAR) is a remote sensing technique that can provide measurements of affected areas independent of weather or lighting conditions. Usage of SAR, however, is hindered by domain knowledge that is necessary for the pre-processing steps and its interpretation requires expert knowledge. We provide simplified, pre-processed, machine-learning ready SAR datacubes for four globally located landslide events obtained from several Sentinel-1 satellite passes before and after a landslide triggering event together with segmentation maps of the landslides. From this dataset, using the Hokkaido, Japan datacube, we study the feasibility of SAR-based landslide detection with supervised deep learning (DL). Our results demonstrate that DL models can be used to detect landslides from SAR data, achieving an Area under the Precision-Recall curve exceeding 0.7. We find that additional satellite visits enhance detection performance, but that early detection is possible when SAR data is combined with terrain information from a digital elevation model. This can be especially useful for time-critical emergency interventions. Code is made publicly available at https://github.com/iprapas/landslide-sar-unet.

translated by 谷歌翻译

Exploring Low Rank Training of Deep Neural Networks

Siddhartha Rao Kamalakara , Acyr Locatelli , Bharat Venkitesh , Jimmy Ba , Yarin Gal , Aidan N. Gomez

分类：机器学习 | (统计)机器学习

2022-09-27

培训低级的深层神经网络，即使用分解层，特别是社区感兴趣的：它在记忆消耗和训练时间方面提供了对未分离培训的效率。先前的工作集中在预训练的网络的低级近似值和低级空间中的培训中，并提供了其他目标，为所选实践提供了各种临时解释。我们分析了在实践中运作良好的技术，并通过对诸如GPT2之类的模型进行广泛的消融，我们提供了证据表明该领域的共同信念，这暗示着令人兴奋的研究机会仍然需要回答。

translated by 谷歌翻译

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Khanh Nguyen , Ali Furkan Biten , Andres Mafla , Lluis Gomez , Dimosthenis Karatzas

分类：计算机视觉

2022-09-21

人类利用先验知识来描述图像，并能够使其解释适应特定的上下文信息，即使在上下文信息和图像不匹配时，也可以在发明合理的解释的范围内。在这项工作中，我们提出了通过整合上下文知识来字幕Wikipedia图像的新颖任务。具体而言，我们制作的模型共同推理了Wikipedia文章，Wikimedia图像及其相关描述以产生上下文化的标题。特别是，可以使用类似的Wikimedia图像来说明不同的文章，并且所产生的标题需要适应特定的上下文，因此使我们能够探索模型的限制以调整标题为不同的上下文信息。该领域中的一个特殊挑战性的任务是处理量不多的单词和命名实体。为了解决这个问题，我们提出了一个预训练目标，掩盖了命名实体建模（MNEM），并表明与基线模型相比，此借口任务可以改善。此外，我们验证了Wikipedia中使用MNEM目标预先训练的模型可以很好地推广到新闻字幕数据集。此外，我们根据字幕任务的难度定义了两种不同的测试拆分。我们提供有关每种方式的作用和重要性的见解，并突出我们模型的局限性。接受时，代码，模型和数据拆分可公开可用。

translated by 谷歌翻译

Documenting use cases in the affective computing domain using Unified Modeling Language

Isabelle Hupont , Emilia Gomez

分类：人工智能

2022-09-19

对AI的道德影响和值得信赖系统的设计的研究需要分析使用AI系统的方案，这与“用例”的软件工程概念和“预期目的”法律术语有关。但是，没有用于涵盖使用，范围，功能要求和AI系统风险的用例文档的标准方法。在这项工作中，我们为AI用例提出了一种新颖的文档方法，特别关注情感计算领域。我们的方法基于对研究文献中记录的用例信息需求的评估以及最近提议的AI欧洲监管框架。通过此评估，我们采用并调整了统一的建模语言（UML），在过去的二十年中，这主要由软件工程师使用。然后，每个用例都由UML图和一个结构化表表示，我们提供了一组示例，说明了其在几种情感计算方案中的应用。

translated by 谷歌翻译